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Summary 

By a series of simple examples, we illustrate how the lack of mathematical concern can 
readily lead to surprising mathematical contradictions in wave mechanics. The basic math- 
ematical notions allowing for a precise formulation of the theory are then summarized and 
it is shown how they lead to an elucidation and deeper understanding of the aforementioned 
problems. After stressing the equivalence between wave mechanics and the other formu- 
lations of quantum mechanics, i.e. matrix mechanics and Dirac's abstract Hilbert space 
formulation, we devote the second part of our paper to the latter approach: we discuss the 
problems and shortcomings of this formalism as well as those of the bra and ket notation 
introduced by Dirac in this context. In conclusion, we indicate how all of these problems 
can be solved or at least avoided. 

"'^This work was supported by the Alexander von Humboldt Foundation while the author was on leave of 
absence at the Institut fiir Theoretische Physik of the University of Gottingen. The revised version of the 
text was prepared during a sabbatical leave spent at the Institut fiir Theoretische Physik of the Technical 
University of Vienna. 

^I dedicate these notes to the memory of Tanguy Altherr who left us very unexpectedly in the mountains 
which he liked so much (and where I could do some nice trips with him). As to the subject of these notes 
(which I had the pleasure to discuss with him), Tanguy appreciated a lot Dirac's formalism and certainly 
knew how to apply it to real physical problems (like the ones he worked on with an impressive enthusiasm, 
energy and productivity) . Even if he did not share my preoccupations in this field, he liked to discuss the 
problems related to the general formalism of quantum physics. 
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A There is no surprise 



Chapter 1 
Introduction 



Let us first provide an overview of tlie present article. Tlie first part, covering section 2 and 
some complements gathered in the appendix, deals with mathematical surprises in quantum 
theory. It is accessible to all readers who have some basic notions in wave mechanics. The 
second part, consisting of section 4, deals with Dirac's abstract Hilbert space approach and 
his bra and ket notation; it is devoted to a critical study of this formalism which has become 
the standard mathematical language of quantum physics during the last decades. Both 
parts are related by section 3 which discusses the equivalence between wave mechanics and 
the other formulations of quantum mechanics. As a consequence of this equivalence, the 
mentioned surprises also manifest themselves in these other formulations and in particular 
in Dirac's abstract Hilbert space approach. In fact, one of the points we want to make 
in the second part, is that the lack of mathematical concern which is practically inherent 
in Dirac's symbolic calculus, is a potential source for surprises and that it is harder, if 
not impossible, to elucidate unambiguously all mathematical contradictions within this 
formalism. 

Let us now come to the motivation for our work and present in greater detail the outline 
of our discussion. 

Mathematical surprises 

In the formulation of physical theories, a lack of mathematical concern can often and 
readily lead to apparent contradictions which are sometimes quite astonishing. This is 
particularly true for quantum mechanics and we will illustrate this fact by a series of simple 
examples to be presented in section 2.1. In the literature, such contradictions appeared in 
the study of more complicated physical phenomena and they even brought into question 
certain physical effects like the Aharonov-Bohm effect [|1|. These contradictions can only 
be discarded by appealing to a more careful mathematical formulation of the problems, 
a formulation which often provides a deeper physical understanding of the phenomena 
under investigation. Therefore, we strongly encourage the reader to look himself for a 
solution of all problems, eventually after a reading of section 2.2 in which we review the 
the mathematical formalism of wave mechanics. In fact, the latter review also mentions 
some mathematical tools which are not advocated in the majority of quantum mechanics 
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textbooks, though they are well known in mathematical physics. The detailed solution of 
all of the raised problems is presented in the appendix. Altogether, this discussion provides 
an overview of the subtleties of the subject and of the tools for handling them in an efficient 
manner. 

Dirac's formalism 

Since all of our examples of mathematical surprises are formulated in terms of the 
language of wave mechanics, one may wonder whether they are also present in other for- 
mulations of quantum theory. In this respect, we recall that there are essentially three 
different formulations or 'representations' that are used in quantum mechanics for the de- 
scription of the states of a particle (or of a system of particles): wave mechanics, matrix 
mechanics and the invariant formalism. The first two rely on concrete Hilbert spaces, the 
last one on an abstract Hilbert space. In general, the latter formulation is presented using 
the bra and ket notation that Dirac developed from 1939 on, and that he introduced in the 
third edition of his celebrated textbook on the principles of quantum mechanics 0. Let 
us briefiy recall its main ingredients which will be discussed in the main body of the text: 



• {l^)}„eN and {\x)}^^^ 

• \ni,n2, ...) associated with a complete system of commuting observables {Ai, A2, ...} . 

This notation usually goes together with a specific interpretation of mathematical opera- 
tions given by Dirac. 

In this respect, it may be worthwhile to mention that Dirac's classic monograph (and 
thereby the majority of modern texts inspired by it) contains a fair number of statements 
which are ambiguous or incorrect from the mathematical point of view: these points have 
been raised and discussed by J.M.Jauch ||^. The state of affairs can be described as 
follows [^: "Unfortunately, the elegance, outward clarity and strength of Dirac's formalism 
are gained at the expense of introducing mathematical fictions. [...] One has a formal 
'machinery' whose significance is impenetrable, especially for the beginner, and whose 
problematics cannot be recognized by him." Thus, the verdict of major mathematicians 
like J.Dieudonne is devastating [^]: "When one gets to the mathematical theories which 
are at the basis of quantum mechanics, one realizes that the attitude of certain physicists 
in the handling of these theories truly borders on the delirium. [...] One has to wonder 
what remains in the mind of a student who has absorbed this unbelievable accumulation 
of nonsense, a real gibberish! It should be to believe that today's physicists are only at 
ease in the vagueness, the obscure and the contradictory." Certainly, we can blame many 
mathematicians for their intransigence and for their refusal to make the slightest effort to 
understand statements which lack rigor, even so their judgment should give us something 
to think about. By the present work, we hope to contribute in a constructive way to this 
reflection. 
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In section 3, we will define Dirac's notation more precisely while recalling some mathe- 
matical facts and evoking the historical development of quantum mechanics. In section 4, 
we will successively discuss the following questions: 

1. Is any of the three representations to be preferred to the other ones from the mathe- 
matical or practical point of view? In particular, we discuss the status of the invariant 
formalism to which the preference is given in the majority of recent textbooks. 

2. What are the advantages, inconveniences and problems of Dirac's notations and of 
their interpretation? (The computational rules inferred from these notations and 
their interpretation are usually applied in the framework of the invariant formal- 
ism and then define a symbolic calculus - as emphasized by Dirac himself in the 
introduction of his classic text 0.) 

To anticipate our answer to these questions, we already indicate that we will reach 
the conclusion that the systematic application of the invariant formalism and the rigid use 
of Dirac's notation - which are advocated in the majority of modern treatises of quantum 
mechanics - are neither to be recommended from a mathematical nor from a practical point 
of view. Compromises which retain the advantages of these formalisms while avoiding their 
shortcomings will be indicated. The conclusions which can be drawn for the practice and 
for the teaching of quantum theory are summarized in the final section (which also includes 
a short guide to the literature). 
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Chapter 2 
Wave mechanics 



2.1 Mathematical surprises in quantum mechanics 

Examples which are simple from the mathematical point of view are to be followed by 
examples which are more sophisticated and more interesting from the physical point of 
view. All of them are formulated within the framework of wave mechanics and in terms of 
the standard mathematical language of quantum mechanics textbooks. The theory of wave 
mechanics being equivalent to the other formulations of quantum mechanics, the problems 
we mention are also present in the other formulations, though they may be less apparent 
there. The solution of all of the raised problems will be implicit in the subsequent section 
in which we review the mathematical formalism of wave mechanics. It is spelled out in 
detail in the appendix (but first think about it for yourself before looking it up!). 

Example 1 

For a particle in one dimension, the operators of momentum P and position Q satisfy 
Heisenberg's canonical commutation relation 

[P,Q] = yl . (2.1) 
By taking the trace of this relation, one finds a vanishing result for the left-hand side, 

h 

Tr [P, Q] = 0, whereas Tr ( — 1) 7^ 0. What is the conclusion? 
Example 2 

Consider wave functions ei ip which are square integrable on R and the momentum 

h d „ 
operator P = — —. Integration by parts yieldsg 

1 QjX 

/+00 p-\-oo f) 

dx^{x){P^){x) = / dx{Pv){x)i,{x) + ^[{^i,){x)tZ ■ 
-00 J —CO 1 

^The complex conjugate of z G C is denoted by z or z*. 
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Since Lp and i}) are square integrable, one usually concludes that these functions vanish for 
X ±00. Thus, the last term in the previous equation vanishes, which implies that the 
operator P is Hermitian. 

However, the textbooks of mathematics tell us that square integrable functions do, 
in general, not admit a limit for x ±cxd and therefore they do not necessarily vanish 
at infinity. There are even functions which are continuous and square summable on R 
without being bounded at infinity @: an example of such a function is given by /(x) = 
x^exp (— x^sin^x), cf. figure 2.1 where the period of the function has been multiplied by 
a factor 20 in order to increase the number of oscillations. 

5 : 

4 : 

3 f 

2 '- 




Figure 2.1: Graph of /(x) = x^ exp [— x^ sin^ (20x)] 



We note that this example essentially amounts to a refinement of the following better known 

example of a function which is continuous, positive and integrable on R, though it does 

not tend to zero for x ±00 (cf. figure 2.2): consider the "shrinking comb function" 

2 

fix) = J2'^=2 fnix) where /„ vanishes on R, except on an interval of width — centered at 
n, where the graph of /„ is a triangle, which is symmetrical with respect to n and of height 
1. 




Figure 2.2: Shrinking comb function 



The area of this triangle being — , we have 



9 ■ 



r+oo 00 I 

/ dxf{x) = J2 — < ^ 

J-00 ^ 
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but the function / does not tend to zero for x +00. 

Can one conclude that the operator P is Hermitian in spite of these facts and if so, why? 
Example 3 

h d 

Consider the operators P = — — and 'Q = multiplication by x' acting on wave 

i ax 

functions depending on a; G R. Since P and Q are Hermitian operators, the operator 
A = PQ^ + Q^P also has this property, because its adjoint is given by 

= (Pg3 ^ gSp^t _ g3p ^ pg3 _ ^ 

It follows that all eigenvalues of A are real. Nevertheless, one easily verifies that 

^ r . , ^ I —1= kl"^/^ exp ( for a; 7^ ^, 

Af = -f with fix) =<^ 72' ' ^ UxV ^ (2.2) 
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for a; = 



which means that A admits the complex eigenvalue h/i. Note that the function / is 
infinitely different iable on R and that it is square integrable, since 



00 



/"OO P CXD 

dx\f{x)\^ = 2 / dx\f{x)\' = / rfa;x-3e-i/(2.2) = \e-'/^^^"^ 
Jo Jo L 

Where is the error? 
Example 4 

Let us consider a particle confined to the interval [0, 1] and described by a wave function 
ip satisfying the boundary conditions ipi^) = = ipil). Then the momentum operator P = 

— ;— is Hermitian, since the surface term appearing upon integration by parts vanishes: 
i dx 

C dx (rp (Pi;) - (P^) ^) (x) = y [(^^) ix)]l = . (2.3) 

Since P is Hermitian, its eigenvalues are real. In order to determine the latter, we note 
that the eigenvalue equation, 

{Pipp){x) = pipp{x) {peR , ipp^O) , 

is solved by tpp{x) = Cp exp {—px) with Cp G C— {0}. The boundary condition ipp{0) = now 

implies ipp = 0, therefore P does not admit any eigenvalues. Nevertheless, the spectrum 
of P is the entire complex plane 0] and P does not represent an observable. How can one 
understand these results which seem astonishing? 

Example 5 
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If one introduces polar coordinates in the plane or spherical coordinates in space, then 

the polar angle (y? and the component of angular momentum are canonically conjugate 

variables in classical mechanics. In quantum theory, the variable tp becomes the operator 

Ti d 

of 'multiplication of the wave function ip{'^) by and = — which implies the 
commutation relation 

[L,,^] = ^l ■ (2.4) 

These operators acting on periodic wave functions (i.e. ip{0) = iIj{2tt)) are Hermitian. 
Furthermore, Lz admits a complete system of orthonormal eigenfunctions ipm, 

Lz^m = mhipm with ^m{f) = — cxp {iunp) and m G Z . (2.5) 

V 27r 

(For the wave functions ip, we only specify the dependence on the angular variable cp and 
for the orthonormalisation, we refer to the standard scalar product for square integrable 
functions on the interval [0, 27r) : 

2ir 



(^1,^2) = / dip ijjiiip) llj2i^) •) 

Jo 

By evaluating the average value of the operator [Lz, v?] in the state ipm B and by taking 
into account the fact that Lz is Hermitian, one finds that 

= {Lllfjmjipipm) -mn{lp^,ipiljm) (2.6) 

= {mh - mh) {ipm, <P ^m) = . 

There must be a slight problem somewhere... (We note that similar problems appear for 
the phase and number operators which are of interest in quantum optics |TD[-) 

Example 6 

Let us add a bit to the confusion of the previous example! In 1927, Pauli noted that the 
canonical commutation relation ( |2.1| ) implies Heisenberg's uncertainty relation AP ■ AQ > 



— by virtue of the Cauchy-Schwarz inequality. Since the commutation relation ( p.4|) has 



the same form as ( |2.1D , one can derive, in the same way, the uncertainty relation 

AL,-Aip>^ . (2.7) 



The following physical reasoning shows that this inequality cannot be correct 11 



One can always find a state for which ALz < h/An and then the uncertainty for the angle 
ip has to be larger than 2tt, which does not have any physical sense, since (p takes values in 
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the interval [0,27r). How is it possible that relation (|2.4| ) is correct, though the conclusion 
Q is not? 

By the way, this example shows that the uncertainty relation AA-AB > -\ {[A, B]) \ for 

any two observables A and B (whose derivation can be found in most quantum mechanics 
texts) is not valid in such a generality. 

Example 7 

Let us consider a particle of mass m in the infinite potential well 



V{x) 



if |x| < a (a > 0) 
oo otherwise . 



The Hamiltonian for the particle confined to the inside of the well is simply H 



2m dx^ 
Let 

i^ix) = — TTTT (a^ — x^) for |a;| < a (and ipix) = otherwise) (2.8) 

be the normalized wave function of the particle at a given time. Since H'^ip = — — - = 0, 

4m^ dx^ 

the average value of the operator in the state vanishes : 



{H')^ = (V^, H'ij) = / dx ^Pix) {H',p){x) = . (2.9) 

J —a 

This average value can also be determined from the eigenvalues and eigenf unctions of if, 

Hip^ = Enif^ with En = - ^n^ (n=l,2,...) , (2.10) 

by applying the formula 

oo 

{H^)^ = J2Elpn with p„ = . (2.11) 

n=l 

Proceeding in this way, one definitely does not find a vanishing result, because E^ > and 

15^4 

< < 1, Z^^iPn = 1- In fact, the calculation yields (H"^)^ = ^— r. Which one of 

these two results is correct and where does the inconsistency come from? M 



2.2 Clues to the understanding: the mathematical 
formalism 

The problems and contradictions presented in the previous subsection can only be eluci- 
dated by resorting to a more precise mathematical formulation. Therefore, we now sum- 



marize the basic mathematical tools H, |8llflT^l-flT8i of wave mechanics in three subsections. 
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The first two are sufficient for tackfing the raised problems (and actually provide already 
some partial solutions); the last one has been added in order to round up the discussion 
and to provide a solid foundation for the subsequent section 4. 

We consider the motion of a particle on a straight line parametrized by x G R. The 
generalization to a bounded interval, to three dimensions, to the spin or to a system of 
particles does not present any problems. 



2.2.1 The space of states 

Born's probabilistic interpretation of the wave function tjj requires that 



/ \'ilj(x,t)\'^dx = 1 for all t 



e R 



Thus, the wave function which depends continuously on the time parameter t, must be 
square integrable with respect to the space variable x: 

ijj : R X R ^ C 

{x,t) I— > il){x,t) = il!t{x) with ipt E L'^{Il,dx) . 

Here, I/^(R, dx) is the space of square integrable functions, 

L'^{R,dx)^{f -.R^C I / dx\f{x)\'^ <oo} , 

with the scalar product 

{f,9)L^= I dxJix)g{x) for f,geL'{R,dx) . (2.12) 
Jr 

For later reference, we recall that this space is related by the Fourier transformation to the 
Hilbert space I/^(R, dp) of wave functions depending on the momentum p : 

T : L'^{R,dx) — > L'^{R,dp) (2.13) 

1 f i 
/ ' — ^ with (JF/) (p) = — == / dx f{x)exp{--px) . 

v2Trh -/n n 



2.2.2 Operators 



In this section, we will properly define the notion of an operator and its adjoint (the Her- 
mitian conjugate operator), as well as Hermitian operators and self-adjoint operators (i.e. 
physical obscrvablcs). Furthermore, we will indicate how the properties of the spectrum 
of a given operator are intimately connected with the mathematical characteristics of this 
operator and of its eigenf unctions. We strongly encourage the reader who is not familiar 
with the definitions and results spelled out at the beginning of this subsection to pursue 
the reading with the illustrations that follow. 
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The following considerations (i.e. sections 2.2.2 and 2.2.3) do not only apply to the 
concrete Hilbert space L'^(R,dx) of wave mechanics, but to any Hilbert space Ti. that is 
isomorphic to it, i.e. any complex Hilbert space which admits a countably infinite basisQ; 
therefore, we will refer to a general Hilbert space Ti. of this type and only specialize to the 
concrete realizations L'^(R, dx) or L^([0, 1], dx) in the examples. 

Just as a function / : R — R has a domain of definition C R, a Hilbert space 

operator A : Ti ^ Ti admits such a domain: 

Definition 1 An operator on the Hilbert space Ti is a linear map 

A : V{A) — > n (2.14) 

I — > Alp , 

where V^A) represents a dense linear subspace of TC. This subspace is called the domain 
of definition of A or, for short, the domain of A. (Thus, strictly speaking, a Hilbert space 
operator is a pair {A,V{A) ) consisting of a prescription of operation on the Hilbert space, 
together with a Hilbert space subset on which this operation is defined.) 

If B denotes another operator on H {with domain T>[B)), then the operator A is said to 
he equal to the operator B if both the prescription of operation and the domain of definition 
coincide, i.e. if 

Aif = Bif for all if G V{A) = V{B) . 
In this case, one writes A = B. 

We note that, in the mathematical literature, the definition of an operator is usually 
phrased in more general terms by dropping the assumptions that T>{A) is dense and that 
the map A is linear; however, these generalizations scarcely occur in quantum mechanics 
and therefore we will not consider them here. 

As emphasized in the previous definition, an operator is not simply a formal operating 
prescription and two operators which act in the same way are to be considered as different if 
they are not defined on the same subspace of Hilbert space. A typical example for the latter 
situation is given by a physical problem on a compact or semi-infinite interval: the domain 
of definition of operators then includes, in general, some boundary conditions whose choice 
depends on the experimental set-up which is being considered. Clearly, two non equivalent 
experimental set-ups for the measurement of a given physical observable generally lead to 
different experimental results: thus, it is important to consider as different two Hilbert 
space operators which act in the same way, but admit different domains of definition. 
Actually, most of the contradictions derived in section 2 can be traced back to the fact 
that the domains of definition of the operators under consideration have been ignored! 

By way of example, let us first consider three different operators on the Hilbert space 
Ti. = L'^(R, dx) and then another one on L'^{[0, 1]). 

l.a) The position operator Q for a particle on the real line is the operator 'multiplication 
by x' on L^(R, dx): 

{Q^){x) = xtpix) for all a; gR. (2.15) 
^The precise definition of an isomorphism is recalled in section 3.2 below. 
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The maximal domain of definition for Q is the one which ensures that the function Qxp 
exists and that it still belongs to the Hilbert space : 



= e L'^{R,dx) \ WQtpf = [ dxxV(a;)r < 00} • (2.16) 

JR 

Obviously, this represents a proper subspace of L'^(R,, dx) and it can be shown that it is a 
dense subspace [|], |14 . 

l.b) Analogously, the maximal domain of definition of the momentum operator P = — — 

i dx 

on the Hilbert space L^(R, rfx) i^ 

v^UP) = {^e l2(r, dx) I V' e l2(r, dx)} . 

1. c) For certain considerations it is convenient to have at one's disposal a domain of 
definition that is left invariant by the operator (rather than a maximally defined operator). 
For the operator Q, such a domain is given by the Schwartz space S (R) of rapidly decreasing 
functions. Let us recall that a function / : R — > C belongs to 5(R) if it is differentiable an 
infinite number of times and if this function, as well as all of its derivatives, decreases more 
rapidly at infinity than the inverse of any polynomial). This implies that 5(R) C 'r'max(Q) 
and 

Q : 5(R) — > 5(R) . (2.17) 

The Schwartz space also represents an invariant domain of definition for the momentum 

operator P = - on ^^(R, dx), i.e. P : S{K) S{R). 
i dx 

According to the definition given above, the operators P and Q introduced in the last 
example are different from P and Q as defined in the preceding two examples, respectively; 
however, in the present case where we have an infinite interval, this difference is rather 
a mathematical one since physical measurements do not allow us to make a distinction 
between the different domains. 

2. Let us also give an example concerning a particle which is confined to a compact 
interval e.g. [0, 1]. In this case, the wave function generally satisfies some boundary 
conditions which have to be taken into account by means of the domains of the operators 
under investigation. For instance, let us consider example 4 of section 2, i.e. the momentum 
operator P on 7i = i^^([0, 1], dx) with the usual boundary conditions of an infinite potential 
well: the domain of definition of P then reads 



V{P) = {ij eH\ij' e'Ha.ndi){<d)=Q = ^{l)} . (2.18) 

■^Since the integral involved in the definition of the space £^(R, dx) is the one of Lebesgue, one only needs 
to ensure that the considered functions behave correctly 'almost everywhere' with respect to Lebesgue's 
measure (see textbooks on analysis) : thus, ■0' G (R-i dx) means that the derivative ijj' exists almost 
everywhere and that it belongs to (R, dx) . 
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Shortly, we will come to the mathematical and physical properties of this quantum me- 
chanical operator. 

The specification of the domain plays a crucial role when introducing the adjoint At of 
a Hilbert space operator A: 

Definition 2 For an operator A on H, the domain of At is defined by 

= {(p&7i\3(f&7i such that 

{v,A^) = {^,ij)forallijeV{A)} . 

(Here, the vector (p depends on both A and ip.) For ip E 'D(A^), one defines A'^Lp = (p, i.e. 

{ip, Alp) = {A^^, V^) for all G V{A) . (2.19) 

By way of example, we again consider the Hilbert space n = L2([0, l],dx) and the 
momentum operator P with the domain ( 2.1^ ). According to the previous definition, the 

domain of 

pt 

is given by 

p(pt) = {ip en\3<p en such that ((^, Ptp) = ip) for all ^p G P(P)} 
and the operating prescription for pt is determined by the relation 

(y?, P^p) = (Pty;, ip) for all ip G V{P) . (2.20) 
The integration by parts ([^.3|) or, more precisely. 



1 



dx{JpP^-[^-^\^)ix) = ^ ip{l)ij{l) - ^{0)^(0) 
\i ax J 1 L 

= for all Ip G V{P) 

shows that the boundary conditions satisfied hy ip E T>{P) are already sufficient for anni- 
hilating the surface term and it shows that pt acts in the same way as P. Hence, 

pt = --^ , V{P^) = {y^en\y^' en} . (2.21) 
1 ax 

Thus, the domain of definition of pt is larger than the one of P: V{P) C I?(pt). 

In quantum theory, physical observables are described by Hilbert space operators which 
have the property of being self-adjoint. Although many physics textbooks use this term as 
a synonym for Hermitian, there exists a subtle difference between these two properties for 
operators acting on mfinite dimensional Hilbert spaces: as we will illustrate in the sequel, 
this difference is important for quantum physics. 
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Definition 3 The operator A on Ti. is Hermitian if 



{<^,A4j) = {AyD,^) for all<^,tlj eV{A) (2.22) 



I.e. 



A^(p = A(p for all ip G V{A) . 

(In other words, the operator A is Hermitian if A^ acts in the same way as A on all vectors 
belonging to T>(A), though A^ may actually be defined on a larger subspace than T)(A).) 

An operator A on Ti, is self-adjoint if the operators A and A^ coincide (A = A^ ), i.e. 
explicitly 

V{A) = V(A^) and A^^ = A^ for all ^ e V{A) . (2.23) 

Thus, any self-adjoint operator is Hermitian, but an Hermitian operator is not necessarily 
self-adjoint. Our previous example provides an illustration for the latter fact. We found 
that P and act in the same way, though the domain of definition of pt [as given 
by equation ( p.21j )) is strictly larger than the one of P (as given by equation ( p.l8|) ): 



V{P) C r'(pt), but V{P) ^ P(pt). From these facts, we conclude that P is Hermitian, 
but not self-adjoint: P ^ pt. 

One may wonder whether it is possible to characterize in another way the little "extra" 
that a Hermitian operator is lacking in order to be self-adjoint. This missing item is 
exhibited by the following result which is proven in mathematical textbooks. If the Hilbert 
space operator A is self-adjoint, then its spectrum is real H[|T^-[]T^ and the eigenvectors 



associated to different eigenvalues are mutually orthogonal; moreover, the eigenvectors 
together with the generalized eigenvectors yield a complete system of (generalized) vectors 
of the Hilbert spaceQ ^ ^. These results do not hold for operators which are only 



Hermitian. As we saw in section 2.1, this fact is confirmed by our previous example: 
the Hermitian operator P does not admit any proper or generalized eigenfunctions and 
therefore it is not self-adjoint (as we already deduced by referring directly to the definition 
of self-adjointness). 

Concerning these aspects, we recall that the existence of a complete system of (gen- 
eralized) eigenfunctions is fundamental for the physical interpretation of observables. It 
motivates the mathematical definition of an observable which is usually given in text- 
books on quantum mechanics: an observable is defined as a "Hermitian operator whose 



orthonormalized eigenvectors define a basis of Hilbert space" ^1|]. The shortcomings of 
this approach (as opposed to the identification of observables with self-adjoint operators) 
will be discussed in section 4. Here, we only note the following. For a given Hilbert 
space operator, it is usually easy to check whether it is Hermitian (e.g. by performing 
some integration by parts). And for a Hermitian operator, there exist simple criteria for 
self-adjointness: the standard one will be stated and applied in the appendix. 



^The vectors which satisfy the eigenvalue equation, though they do not belong to the Hilbert space H, 
but to a larger space containing Ti, are usually referred to as "generalized eigenvectors", see section 2.2.3 
below. 
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Finally, we come to the spectrum of operators for which we limit ourselves to the general 
ideas. By definition, the spectrum of a self- adjoint operator A on the Hilbert space Ti. is 
the union of two sets of real numbers, 

1. the so-called discrete or point spectrum, i.e. the set of eigenvalues of A (that is 
eigenvalues for which the eigenvectors belong to the domain of definition of A). 

2. the so-called continuous spectrum, i.e. the set of generalized eigenvalues of A (that 
is eigenvalues for which "the eigenvectors do not belong to the Hilbert space Ti"). 

These notions will be made more precise and illustrated in the next subsection as well as in 
the appendix. From the physical point of view, the spectral values of an observable (given 
by a self-adjoint operator) are the possible results that one can find upon measuring this 
physical quantity. The following two examples are familiar from physics. 

1. For a particle moving on a line, the observables of position and momentum can both 
take any real value. Thus, the corresponding operators have a purely continuous and 
unbounded spectrum: SpQ = R and SpP = R. A rigorous proof of this result will be 
given shortly (equations ( |2.26| )-( pl29| ) below). 

2. For a particle confined to the unit interval and subject to periodic boundary conditions, 
we have wave functions belonging to the Hilbert space L'^{[0,l],dx) and satisfying the 
boundary conditions ip{0) = In this case, the position operator admits a continuous 
and bounded spectrum given by the interval [0, 1]. The spectrum of the momentum op- 
erator is discrete and unbounded, which means that the momentum can only take certain 
discrete, though arbitrary large values. 

In order to avoid misunderstandings, we should emphasize that the definitions given in 
physics and mathematics textbooks, respectively, for the spectrum of an operator acting 
on an infinite dimensional Hilbert space, do not completely coincide. In fact, from a math- 
ematical point of view it is not natural to simply define the spectrum of a generic Hilbert 
space operator as the set of its proper and generalized eigenvalues (e.g. see references 
and P): by definition, it contains a third part, the so-called residual spectrum. The latter 
is empty for self-adjoint operators and therefore we did not mention it above. However, 
we will see in the appendix that the residual spectrum is not empty for operators which 
are only Hermitian; we will illustrate its usefulness for deciding whether a given Hermitian 
operator can possibly be modified so as to become self-adjoint (i.e. precisely the criteria 
for self-adjointness of a Hermitian operator that we already evoked). 

Before concluding our overview of Hilbert space operators, we note that all definitions 
given in this subsection also hold if the complex Hilbert space 7i is of finite dimension n, 
i.e. Ti. ^ C". In this case (which is of physical interest for the spin of a particle or for a 
quantum mechanical two-level system), the previous results simplify greatly: 

- any operator A on Ti. (which may now be expressed in terms of a complex n x n 
matrix) and its adjoint (the Hermitian conjugate matrix) are defined on the whole Hilbert 
space, i.e. T>{A) = H 

- Hermitian and self-adjoint are now synonymous 
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- the spectrum of A is simply the set of all its eigenvalues, i.e. we have a purely discrete 
spectrum. 

Yet, it is plausible that the passage to infinite dimensions, i.e. the addition of an infinite 
number of orthogonal directions to C" opens up new possibilities! As S.MacLane put it: 
"Gentlemen: there's lots of room left in Hilbert space." [jl4| 



2.2.3 Observables and generalized eigenfunctions 

When introducing the observables of position and momentum for a particle on the real 
line, we already noticed that they are not defined on the entire Hilbert space, but only on 
a proper subspace of it. In this section, we will point out and discuss some generic features 
of quantum mechanical observables. 

Two technical complications appear in the study of an observable A in quantum me- 
chanics: 

(i) If the spectrum of A is not bounded, then the domain of definition of A cannot be all 

of n. 

(ii) If the spectrum of A contains a continuous part, then the corresponding eigenvectors 
do not belong to H, but rather to a larger space. 

Let us discuss these two problems in turn before concluding with some related remarks. 



(i) Unbounded operators 

The simplest class of operators is the one of bounded operators, i.e. for every vector 
"0 e 1^{A), one has 

II^VII < c ||'?/'|| where c > is a constant . (2.24) 

This condition amounts to say that the spectrum of A is bounded. Bounded operators 
can always be defined on the entire Hilbert space, i.e. T>{A) = Ti. An important example 
is the one of an unitary operator U : H ^ H ; such an operator is bounded, because 
it is norm-preserving (i.e. WUipW = for all ijj & Ti) and therefore condition ( |2.24D is 
satisfied. (The spectrum of U lies on the unit circle of the complex plane and therefore 
it is bounded. For instance, the spectrum of the Fourier transform operator J-' defined by 
(|27r3| ) is the discrete set {±1, ±i} ^ g.) 



A large part of the mathematical subtleties of quantum mechanics originates from the 
following result p|, p 



Theorem 1 (Hellinger-Toeplitz) Let A be an operator on Ti which is everywhere de- 
fined and which satisfies the Hermiticity condition 

{^,A^) = {A<f,^) (2.25) 

for all vectors ^Ti. Then A is bounded. 
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In quantum theory, one often deals with operators, hke those associated to the position, 
momentum or energy, which fulfill the Hermiticity condition ( p.25D on their domain of 



definition, but for which the spectrum is not bounded. (In fact, the basic structural relation 
of quantum mechanics, i.e. the canonical commutation relation, even imposes that some 
of the fundamental operators, which are involved in it, are unbounded - see appendix.) 
The preceding theorem indicates that it is not possible to define these Hermitian operators 
on the entire Hilbert space 7i and that their domain of definition necessarily represents 
a proper subspace of 7i. Among all the choices of subspace which are possible from the 
mathematical point of view, certain ones are usually privileged by physical considerations 
(boundary conditions, ...) ^ [gi, ^, g. 



By way of example, we consider the position operator Q defined on the Schwartz space 
5(R), see eqs.( |2.15| ) ( |2.17| ). This operator is Hermitian since all vectors (p,ip E i5(R) satisfy 



{ip^Qtp) = / dxLpxil)= / dx x(f = {Q(f,-ip) ■ 

As we are going to prove shortly, the spectrum of this operator is the entire real axis 
(refiecting the fact that Q is not bounded). For this specific operator, we already noticed 
explicitly that there is no way to define Q on all vectors of Hilbert space: at best, it can 
be defined on its maximal domain (|2.16|) which represents a nontrivial subspace of Hilbert 
space. 

(ii) Generalized eigenfunctions (Gelfand triplets) 

The position operator Q defined on S(R) also illustrates the fact that the eigenvectors 
associated to the continuous spectrum of a self-adjoint operator do not belong to the 
Hilbert spaceQ. In fact, the eigenfunction ip^o associated to the eigenvalue G R is 
defined by the relation 

(Q^^J (x) = xo tpxoix) (xo G R , G 5(R) , V^o ^ 0) (2.26) 

or else, following (|2.15| ), by 

(x — Xo) "ipxoix) =0 for all X G R . 

This condition implies ipxoi^) = for x 7^ xq. Consequently, the function ipxo vanishes 
almost every whereQ and thus represents the null vector of L'^(R,, dx) |T3|, Hence, the 



operator Q does not admit any eigenvalue: its discrete spectrum is empty. 

We note that the situation is the same for the operator P defined on i5(R) which is 
also essentially self-adjoint: the eigenvalue equation 

(Pijp) (x) = pijpix) (pgr, G 5(R) , ^ 0), , 



^To be precise, the operator Q defined on 5(R) is essentially self-adjoint which impUes that it can be 
rendered self-adjoint in a unique manner by enlarging its domain of definition in a natural way (see § 
for details). 

® Actually, ip^o S '^(R) implies that "i/j^o is continuous and therefore it follows that ipxo vanishes every- 
where. 



16 



is solved by tppix) = l/VSvr/i exp{ipx/fi), but ipp ^ 5(R). Thus P does not admit any 
eigenvalue. 

On the other hand, the eigenvalue equations for Q and P admit weak (distributional) 
solutions. For instance, Dirac's generalized function (distribution) with support in xq, i.e. 
Sxo{x) = S{x — Xq), is a weak solution of the eigenvalue equation (|2.26| ): in order to check 
that x6xo{x) = xoSxo{x) in the sense of distributions, we have to smear out this relation 
with a test function ip E 5(R): 



dx X (x) Lp{x) = xo ^(xo) = dxxo 4o (x) Lp{x) 



(2.27) 

In. " Jn 

Dirac's generalized function and the generalized function x6x^ do not belong to the domain 
of definition S(R) of Q, rather they belong to its dual space 

iS'(R) = {uj : 5(R) C linear and continuous} , 

^ p!9||. They are defined in an 



i.e. the space of tempered distributions on R [|I4 
abstract and rigorous manner by 



5(R) 



C 



(2.28) 



and 



X 6. 



xo 



S(R) 



[xSxo) (v^) = ^xoiw) xqlp{xq) 



With these definitions, the formal writing ( p. 27] ) takes the precise form 
{x5xo){v) = {xq5xo){'^) for all G 5(R) . 

Thus, the eigenvalue equation Qipx^ = xq ipxo admits a distributional solution ip^g for every 
value Xo G R. Since the spectrum of the (essentially self-adjoint) operator Q is the set 
of all real numbers for which the eigenvalue equation admits as solution either a function 
ip G T^{Q) = (discrete spectrum) or a generalized function %p G iS'(R) (continuous 

spectrum), we can conclude that Sp Q = R and that the spectrum of Q is purely continuous. 

Analogously, the function ijjp{x) = l/^/2Tch exp{ipx/h) defines a distribution Ip accord- 
ing to 



5(R) 



c 

lp{<p) 



(2.29) 



R 



dx ipp^x) ip{x) ^'^=^ {^'^){p) 



where jFyj denotes the Fourier transform ( p. 13 ). The distribution Ip represents a solution 
of the eigenvalue equation Pip = pip since the calculational rules for distributions and 
Fourier transforms P, |l^] and the definition (|2.29| ) imply that 



(Pip) if) 
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It follows that SpP = R (purely continuous spectrum). 

The eigenvalue problem for the operators P and Q which admit a continuous spectrum 
thus leads us to consider the following Gelfand triplet ( "rigged or equipped Hilbert space")^ 



5(R) C L\R,dx) C 5'(R) . (2.30) 



Here, iS(R) is a dense subspace of i^^(R, dx) and every function ip G L'^(R, dx) defines 
a distribution uj^ G iS'(R) according to 

oj^ : 5(R) — > C 



I — > iu^{(p) = / dx^jj{x)(p{x) . (2.31) 

However ^X-f^) ^Iso contains distributions like Dirac's distribution or the distribution Ip 
which cannot be represented by means of a function ip G L'^(R,, dx) according to ( |2.31| ). The 
procedure of smearing out with a test function (f G S(R) corresponds to the formation of 
wave packets and the theory of distributions gives a quite precise meaning to this procedure 
as well as to the generalized functions that it involves. 

The abstract definition of the triplet (|2.30|) can be made more precise (e.g. specification 
of the topology on iS(R),...) and, furthermore, iS(R) can be generalized to other subspaces 
of L'^(R,,dx) (associated to Q or to other operators defined on L'^(R,dx)). Up to these 
details (which are important and which have to be taken into account in the study of a 
given Hilbert space operator), we can say that 

the triplet ( gT30D describ es in an exact and simple manner the math- 
ematical nature of all kets and bras used in quantum mechanics.^ 

In fact, according to the Riesz lemma (which is recalled in section 3.1.1 below), the Hilbert 
space L^(R, dx) is isometric to its dual: thus, to each ket belonging to L^(R, rfx), there 
corresponds a bra given by an element of L'^(R,dx), and conversely. Moreover, a ket 
belonging to the subspace 5(R) always defines a bra belonging to 5'(R) by virtue of 
definition ( |2.31| ). But there exist elements of iS'(R), the generalized bras, to which one 
cannot associate a ket belonging to iS(R) or L^(R, dx). We note that the transparency of 
this mathematical result is lost if one proceeds as one usually does in quantum mechanics, 
namely if one describes the action of a distribution on a test function (p in a purely formal 
manner as a scalar product between (f G S(R) C L'^(R,, dx) and a function which does not 
belong to L^(R, dx): 

W) = (^., ^>,, . / d.US)^f(x) ( iguoriug that i L^(R, dx) ) 

^xoi^) = {^xq,^)l2= / dx6xo{x)v{x) (ignoring that ^ L^(R,c/x) ). (2.32) 
Jr. 

^Gelfand triplets are discussed in detail in the textbook (see also |^ and |^ for a slightly modified 
definition). A short and excellent introduction to the definitions and applications in quantum mechanics 
is given in references |2|, |2^, |j. Concerning the importance of Gelfand triplets, we cite their inventors 
|T9[ : "We believe that this concept is no less (if indeed not more) important than that of a Hilbert space." 

^The reader who is not acquainted with bras and kets may skip this paragraph and come back to it 
after browsing through section 3. 
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Quite generally, let us consider a self-adjoint operator A on the Hilbert space H. The 
eigenfunctions associated to elements of the continuous spectrum of A do not belong to 
the Hilbert space TC: one has to equip H with an appropriate dense subspace Q and its 
dual Q' which contains the generalized eigenvectors of A, 

The choice of the subspace Q is intimately connected with the domain of definition of 
the operator A one wants to study^. While the introduction of the space Q is mandatory 
for having a well-posed mathematical problem, the one of Q' is quite convenient, though 
not indispensable for the determination of the spectrum of A. In fact, there exist several 
characterizations of the spectrum which do not call for an extension of the Hilbert spacep^ 
Let us mention three examples. The different parts of the spectrum of A can be described 
by different properties of the resolvent Ra{z) = {A — zl)~^ (where z G C) |T3[ or 
(in the case where A is self-adjoint) by the properties of the spectral projectors Ea{X) 
(where A G R) associated to A |^ ^, or else by replacing the notion of distributional 
eigenfunction of A by the one of approximate eigenfunction [|15|. (The latter approach 
reflects the well-known fact that distributions like S^^i can be approximated arbitrarily well 
by ordinary, continuous functions.) 

(ill) Concluding remarks 

As we already indicated, the examples discussed in the previous subsection exhibit both 
the problems raised by an unbounded spectrum and by a continuous part of the spectrum. 
We should emphasize that these problems are not related to each other; this is illustrated 
by the position and momentum operators for a particle confined to a compact interval and 
described by a wave function with periodic boundary conditions (see example 2 at the end 
of section 2.2.2). 

We note that it is sometimes convenient to pass over from unbounded to bounded 
operators. One way to do so consists of using exponentiation to pass from self-adjoint to 
unitary operators (which are necessarily bounded). For instance, if Q and P denote the 
(essentially) self-adjoint operators of position and momentum with the common domain 
of definition 5(R), then Ua = exp(— ^aQ) and H = exp (— -^feP) (with a, 6 G R) define 
one-parameter families of unitary operators. In particular, Vb represents the translation 
operator for wave functions: 

(Vbij) (x) = ip{x - b) for G L^{K, dx) . 

^Generally speaking, one has to choose the space in a maximal way so as to ensure that U,' is as 
"near" as possible to Ti : this provides the closest possible analogy with the finite-dimensional case where 
= 7Y = il' [^0[ In fact, if one chooses Vt large enough, the space Vl' is so small that the generalized 
eigenvectors of A which belong to Vt' "properly" characterize this operator (see references [|3[ for a 
detailed discussion). 

^°In this context, let us cite the authors of reference "We only recommend the abstract rigged 

space approach to readers with a strong emotional attachment to the Dirac formalism." This somewhat 
provocative statement reflects fairly well the approach followed in the majority of textbooks on functional 
analysis. 
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This bounded operator admits a continuous spectrum consisting of the unit circle. If 

h 

written in terms of Ua and VJ,, the canonical commutation relation (CCR) [P, Q] = —1 
takes the so-called Weyl form 

UaVb = e-^'^'VbUa . (2.33) 

This relation can be discussed without worrying about domains of definition since it only 

involves bounded operators. It was used by J. von Neumann to prove his famous uniquess 

theorem for the representations of the CCR: this result states that the Schrodinger repre- 

h d 

sentation 'Q = multiplication by x', P = —— is essentially the only possible realization 

i dx 

of the CCR H ig]. 
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Chapter 3 

Quantum mechanics and Hilbert 
spaces 



The different representations used in quantum mechanics are discussed in numerous mono- 
graphs |21| and we will summarize them in the next subsection while evoking the his- 
torical development. The underlying mathematical theory is expounded in textbooks on 
functional analysis |T3]. Among these books, there are some excellent monographs which 



present the general theory together with its applications to quantum mechanics |T4, H, 15 



(see also ||^, |TB|, 0). The basic result which is of interest to us here (and on which 
we will elaborate in subsection 3.2) is the fact that all representations mentioned above 
are equivalent. This implies that the problems we presented in section 2 are not simply 
artefacts of wave mechanics and that such problems manifest themselves in quantum the- 
ory whatever formulation is chosen to describe the theory. In fact, we could have written 
some of the equations explicitly in terms of operators P, Q, L^, acting on an abstract 
Hilbert space and in terms of vectors |m), ... belonging to such a space, e.g. eqs. (|2.1| ) and 



3.1 The different Hilbert spaces 

As in section 2.2, we consider the motion of a particle on a straight line parametrized by 
X G R. There are essentially three Hilbert spaces which are used for the description of the 
states of this particle. 

(1) Wave mechanics: 

Wave mechanics was developed in 1926 by E. Schrodinger "during a late erotic outburst 
in his life" p9| , |30[| . There were two sources of inspiration. The first was the concept of 



matter waves which L. de Broglie had introduced in his thesis a few years earlier (1923). The 
second was given by the papers on statistical gas theory that Einstein had published in 1925 
with the aim of extending some earlier work of S.N. Bose (which papers led to the famous 
"Bose-Einstein statistics" for indistinguishable particles): Einstein believed that molecules 
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as well as photons must have both particle and wave characters [^. E. Schrodinger 
laid the foundations of wave mechanics in three remarkable papers that he wrote in early 
1926, just after returning from the studious Christmas holidays that he had spent with 



an unknown companion in the ski resort of Arosa Looked upon in retrospective and 
from the mathematical point of view, he considered the Hilbert space L'^(R, dx) of square 
integrable functions (i.e. wave functions) with the scalar product ( p.l2| ) (or, equivalently, 
the space I/^(R, dp) of wave functions depending on the momentum p, i.e. the so-called p- 
representation) . It is worthwhile recalling that any element ip G L'^(R,, dx) can be expanded 
with respect to a given orthonormal basis {v^njneN of L'^(R,dx): 

oo 

^ = H C"*^" ^it'^ = (V'n,^)L2 G C . (3.1) 
n=0 

If ip' = J2n c'nfn dcuotcs another element of L'^(R,, dx), its scalar product with ip takes the 
form 

oo 

(^,^')L2 = ECn< . (3.2) 
n=0 



In particular, for the norm = {ip, ip) of ip, we have 

oo 

llV'ir = E \cn\' . (3.3) 

n=0 

(2) Matrix mechanics: 

Once an orthonormal basis {v^njneN of i^^(R, dx) has been chosen, the expansion ( |3.1| ) 
of ip & L'^(R,, dx) with respect to this basis expresses a one-to-one correspondence between 
the wave function ip and the infinite sequence c = (cq, ci, ...) of complex numbers. By virtue 
of relation ( p.3|) , the square integrability of the function ip (i.e. W'lpW'^ = /r \4'{x)\'^dx < oo) 
is equivalent to the square summability of the sequence c (i.e. J2n < oo). 

Thus, quantum mechanics can also be formulated using the Hilbert space I2 of square 
summable sequences, 

00 

h = {c= {co,ci, ...) \ Cn e C and E l^nP < 00} , 

n=0 

with the scalar product 

00 

(c, c' = ^c^c'^ for all c, c' e h ■ 

n=0 

The linear operators on I2 are then given by square matrices of infinite order. This formu- 
lation of quantum theory is known as matrix mechanics and was actually the first one to 



be discovered 29, 31, BO 



- In 1925, Heisenberg wrote his pioneering paper pointing in the right direction. 
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- A first comprehensive account of tlie foundations of matrix mechanics was given in 
November 1925 in the famous "Dreimannerarbeit" (three men's work) of Born, Heisenberg 
and Jordan. 

- Some of these results have been put forward independently by Dirac who introduced 
the concept of commutator of operators and identified it as the quantum analogon of the 



classical Poisson brackets 32 



- Pauli's subsequent derivation of the spectrum of the hydrogen atom using matrix 
methods provided strong evidence for the correctness of this approach. 

Despite this success, matrix mechanics was still lacking clearly stated physical princi- 
ples and therefore only represented high mathematical technology. The situation changed 
in 1926 after Schrodinger proved the equivalence between wave mechanics and matrix me- 
chanics, which is based on the correspondence 



wave function 1 i-i j sequence 



< — > 



-0 e L^{R,dx) j [eel 



2 



(3.4) 



This result led Born to propose his probabilistic interpretation of wave functions which 
finally provided the physical principles of the theory. The equivalence ( p.4|) also represented 
the starting point for the search of an "invariant" version of quantum mechanics. Through 
the work of Dirac and Jordan, this undertaking led to the study of linear operators acting 
on an abstract Hilbert space 

It is a remarkable coincidence that the monograph of Courant and Hilbert |^ devel- 



oping the mathematics of Hilbert space was published in 1924 and that it appeared to be 
written specifically for the physicists of this timeQ. Incidentally, the space I2 has been intro- 
duced in 1912 by D. Hilbert in his work on integral equations, but an axiomatic definition 
of Hilbert space was only given in 1927 by J. von Neumann in a paper on the mathematical 
foundation of quantum mechanics [|18] . In the sequel, the theory of Hilbert space operators 
was further refined, mainly through the contributions of von Neumann [^, Schwartz p5 



and Gelfand [0. Thanks to these refinements, it allows for a precise description of all 
states and observables in quantum mechanics. To conclude our historical excursion on the 
mathematical formulation of quantum theory, we recall the remarkable fact that some of its 
main protagonists (namely Heisenberg, Dirac, Jordan, Pauli and von Neumann) achieved 
their ground-breaking contributions to physics and mathematics around the age of 23. 

(3) The invariant formalism: [Dirac, Jordan, von Neumann, 1926 - 1931] 

One uses an abstract complex Hilbert space Ti that is separable (which means that it 
admits an orthonormal basis consisting of a denumerable family of vectors) and infinite 
dimensional. 

It was in this framework that Paul Adrien Maurice Dirac invented the astute bra and 
ket notation, following a particular interpretation of the expressions involving vectors and 



""^D. Hilbert: "I developed my theory of infinitely many variables from purely mathematical interests 
and even called it 'spectral analysis' without any pressentiment that it would later find an application to 
the actual spectrum of physics." l36[ 
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operators^]. Before discussing this interpretation and the resuhing advantages and short- 
comings (section 4.2), we first summarize Dirac's notation together with some fundamental 
concepts of the theory of Hilbert spaces ||, |8|1 



3.1.1 Dirac's notation 

A vector l^p) g is called a ket and to this vector we can associate a linear form ui^,) = (\E'| 
called a bra and defined by means of the scalar product {"bracket"): 

cj|^) = (^1 : n ^ C (3.5) 

1$) ^ a7|^)(|$)) = (|^), = (^1$) . 

According to the Cauchy-Schwarz inequality relating the scalar product and the norm 



|(vl/|$)|<||vl/||.||$|| , 

the linear form ti;|,i,) is continuous on Ti: this means that for every |$) € 7i there exists a 
constant c > such that |cij|^)(|$))| < c||$||. Consequently, the bra (^^l is an element of 
the dual Hilbert space 

Ti* = {uj : Ti. — > C linear and continuous} . 

Conversely, to each bra (\1'| G 7i* we can associate a ket |\E') G 7i; in fact, by virtue of 
the Riesz lemma, every element uj G Ti* uniquely determines a vector |\1'(^) G Ti. such that 

cj(|$)) = ($^|$) for all 1$) G 7^ . 

(The vector |\E'^) "realizes" the map u by means of the scalar product.) The vector associ- 
ated to the linear form (^'l is denoted by |\1') and thus we have a one-to-one correspondence 
between |^) G 7^ and (^| G 7^*: 



H 9 1^) ^ (^1 G n* . (3.6) 



Hence we can identify^ Ti and Ti* and completely do without Ti*. The introduction of a 
dual vector space is only necessary for defining generalized vectors, as we saw in section 
2.2.3. 



^As pointed out by H.Kragh a peculiar feature of Dirac's physics was his interest in notation and 
his readiness to invent new terms and symbols. Apart from the bra and ket notation, the (5-function and 
the commutator of operators that he introduced, we should mention that he coined the terms c-number, 
fermion and boson and that he defined ?i as a short form for h/2'K. 

■^If one defines the norm of w G Ti* by |jw|| = sup |ti'(/)| (where the supremum is taken over all unit 
vectors / € TL), then one can show that the bijection Ti Ti.* is antilinear and that it is norm- preserving, 
i.e. it represents an isometry. 
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An orthonormal basis {|$n) = l^)}nGN of H is a set of vectors satisfying the orthonor- 
malization relation 

{nlm) = 6nm for all n, m G N (3-7) 



and the closure relation 



Y,\n){n\ = ln • (3.8) 



n=0 



Applying the latter to any vector |^) G 7i, we find the expansion 

oo 

1^) = E , 



n=0 

that we encountered already for wave functions, cf. eq. (|3.1|) . Note that the relation ( |3.8| ) 
involves the sum of the operators which are obtained by composing two maps: 

n ^ C ^ n (3.9) 

1$) I — > {n\^) I — > (n|<l>) \n) . 



Here, the first map is the linear form (|3.5| ) and the second represents the multiplication of 
a complex number by the vector \n) G 7i. 

As stated in section 2.2.2, an operator on ?i is a linear map 

A : V{A) — > n (3.10) 
1^) I — ^ , 

where T>{A) is a dense linear subspace of Ti. The scalar product of the vectors 1$) and 
^l^') is denoted following Dirac by 

{\<l>),A\^))n = mA\^) . (3.11) 

Thus, the expression may be considered as the result of the composition of two 

linear maps, 

V{A) ^ n ^ C (3.12) 

where the composition is defined as usual by (($| o A) |\E') := ($| (A|^)). However, as 
we will discuss further in section 4.2.1, Dirac did not restrict himself to this unambiguous 
interpretation of the notation that he introduced. 

3.2 Relations between the Hilbert spaces 

In order to describe the relations between the Hilbert spaces introduced in the previous 



section, we need the concepts of unitary operator and of isomorphism [O]. By way of 
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motivation, we recall that equation p.4| ) expresses a one-to-one correspondence between 
the Hilbert spaces of wave mechanics and of matrix mechanics, and that equation ( |3.2| ) 
implies that this correspondence preserves scalar products, i.e. it is realized by a unitary 
operator: this is an example of an isomorphism of Hilbert spaces. 

Definition 4 For i = 1,2, let Tii he a complex separable Hilbert space with scalar product 
( , )7^^. A linear operator U : Hi ^ 0.2 is called unitary if 

(i) U is everywhere defined on Tii . 

(ii) The image ofTii under U is all of 7i2. 

(iii) U preserves the scalar product : 

{Uf,Ug)n, = {f,g)m for all f^geH^ . (3.13) 

Two Hilbert spaces Tii and 7^2 which are related by an unitary operator are said to be 
isomorphic and one writes Tii ^ 7^2- 

To summarize, we can say that two isomorphic Hilbert spaces represent different re- 
alizations of the same abstract structure and that they can be considered as completely 
equivalent. 

Concerning the Hilbert spaces occurring in quantum mechanics, we have at our disposal 
a classic result of functional analysis : 

Theorem 2 (i) The complex Hilbert spaces I2, L^(R,dx) and L^(R,dp) are separable and 
infinite dimensional. 

(ii) Every complex Hilbert space which is separable and infinite dimensional is isomorphic 
to k- 

From this result it follows that all Hilbert spaces introduced above are isomorphic : 



n-k^ L\R, dx) - L\R, dp) . (3.14) 



In particular, the Parseval-Plancherel theorem states that the Fourier transformation ( p.l3| ) 
realizes the isomorphism between -L^(R, dx) and i^^(R, dp) : 

(^/, J'g) = (/, g) for all f,ge L\R, dx) . 

Quite generally, the passage between Ti and the other Hilbert spaces is performed by 
choosing an orthonormal basis {|n)}„eN (or a generalized basis {|a:;)}a;gR,) of Ti and by 
associating to each vector of |\E') G ?i the set of its components with respect to this basis: 

i)n ■■= {n\m) fornGN , = (^0, V'l, •••) ^ /2 
i){x) := for xGR , ^ e L'^{Ii,dx) . (3.15) 

In the second line, the action of (x| on |\E') is to be understood in the sense of the action 
of a distribution on a test vector - see equations (|2.28| ) (|2.32| ). (Thus, strictly speaking, |^) 
has to belong to an appropriate test vector subspace f2 C 7Y.) Schematically: 
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en 




The passage between L'^(R,dx) and I2 is realized in an analogous manner and was 
already described above: to the function ip G L^(R, cix), one associates the sequence 
(co,Ci,...) G h consisting of the components c„ := {ipn,4')L^ of ip with respect to an 
orthonormal basis {v?n}neN of L^(R, rfx). If the wave functions ifn are associated to the 
state vectors \n) G H, i.e. v'„(x) = {x\n), then consistency with the notation ( ^.15| ) requires 
that Cn = ipn, i-e. = (^|^)- 
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Chapter 4 

Discussion of the invariant formalism 
and of Dirac's notation 

In this section, we try to disentangle two matters which are generally presented in an 
interwoven way, namely the choice of an abstract Hilbert space for formulating quantum 
mechanics and the use of the bra and ket notation. By discussing the advantages and 
disadvantages of each of these choices, we aim to highlight their positive points, which one 
should try to preserve in practice, and to draw attention to the pitfalls which are to be 
avoided. 



4.1 The invariant formalism 

Since the different Hilbert spaces used in quantum mechanics are all isomorphic, they 
represent completely equivalent mathematical structures. However, from the practical 
point of view, certain spaces are more appropriate than others^. 

1. The matrix calculus based on the space I2 is not easy to handle and this formalism 
has barely been used after the advent of quantum mechanics for which it played an 
important role ^ . 



The arena of physical phenomena is the so-called configuration space parametrized by 
X, and boundary or regularity conditions directly concern the wave functions defined 
on this space: this privileges the use of the Hilbert space i^^(R, dx). 

The choice of an abstract Hilbert space Ti is usually motivated by the analogy with 
geometry in Euclidean space R" (or C"): the use of "abstract vectors" is more 
geometrical than the one of their components. Thus, it is tempting to work with 
the vectors |^) G 7i while interpreting the sequences belonging to I2 or the functions 
belonging to L^(R, dx) as the components of the vectors |\E') with respect to different 
bases of Ti. In this spirit, the use of an abstract Hilbert space in quantum mechanics 



"^G. Orwell: "All animals are equal, but some animals are more equal than others." 
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is often presented as something more general than wave or matrix mechanics 
However, there are crucial differences between finite and infinite dimensional vector 
spaces which render the analogy with ordinary geometry quite subtle and doubtful. 
In the following, we will discuss the resulting problems which show that the choice of 
an abstract Hilbert space in quantum mechanics obscures or complicates important 
points of the theory. 

4.1.1 Problems 

• For the study of simple problems like the determination of the energy spectrum of the 
harmonic oscillator (for which the eigenfunctions of the Hamiltonian are well defined 
elements of L'^(R, dx)), one has to start with the introduction of the eigendistributions 
\x) of the position operator which do not belong to the Hilbert space Ti. (section 2.2.3). 

• As we emphasized in section 2.2.2 and illustrated in the appendix, the definition of a 
linear operator on an infinite dimensional Hilbert space necessitates the specification 
of an operating prescription and of a domain of definition for this operation. This 
aspect does not simply represent a mathematical subtlety, since the spectrum of 
the operator is quite sensitive to the domain of definition (boundary conditions,...). 
For instance, depending on the choice of domain, the spectrum of the momentum 

operator — — on a compact interval [a,b] C R can be empty, all of C or a subset 

of R (see appendix and reference ||^). While this problem is well posed, from the 
beginning on, for wave functions defined on configuration space, it is not to the same 
extent for an abstract Hilbert space. 

This problematics also appears in quantum statistical mechanics, e.g. the definition 
of the pressure associated to a set of particles confined to a box involves the boundary 



conditions ^8 



Let us come back to the mathematical characterization of observables which was 
already touched upon in section 2.2.2. 

In the invariant formalism of quantum mechanics, an observable is defined as an "Her- 
mitian operator whose orthonormalized eigenvectors define a basis of Hilbert space" 



2T| . Starting from this definition, it can then be shown, in a formal manner, that 
the position and momentum operators on R are observables. Notable complications 
already occur in R^ or R'^ if non-Cartesian coordinates are considered, e.g. the radial 
component Pr of momentum in is Hermitian, but it does not represent an observ- 
able - see Messiah chap. 9. And there are distinctly more complicated operators 
like Hamiltonians involving random potentials, potentials of the form or {x) 
or else Hamiltonians on topologically nontrivial configuration spaces like those for 



the Aharonov-Bohm effect or for anyons |14, 39, 23 1. The definition of an observ- 



able given above then requires to impose ad hoc conditions on the wave functions 
associated to eigenstates (conditions of regularity, finiteness, single- valuedness,...); 
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furthermore, it necessitates the exphcit determination of an orthonormal system of 
eigenvectors and the verification of the closure relation for this system. 

In an approach which takes into account the domains of definition, an observable is 
simply given by a self-adjoint operator (section 2.2.2). This condition ensures that 
the spectrum of the operator is real and that its (generalized) eigenvectors define a 
(generalized) basis of Hilbert space ("Hilbert's spectral theorem"). Moreover, there 
exist simple criteria for checking whether a given Hermitian operator is self-adjoint, 
or for classifying the different manners according to which it can be rendered self- 



adjoint - see 0, D, ^ and appendix. (In general, if an operator admits several 



self-adjoint extensions, the latter describe different physical situations |T^.) In 
particular, it is not necessary to resort to some ad hoc properties of wave functions 
like those mentioned above or to try to determine a complete system of orthonormal 
eigenvectors. The relevance of such a simple and precise approach also comes to light 
in perturbation |2^, ^ or scattering theory ^ . 



An important concept of quantum mechanics is the one of CSCO (complete system 
of commuting observables) . It involves the commutativity of self-adjoint operators 
which represents a subtle notion for unbounded operators. In fact, two self-adjoint 
operators A and B commute if and only if all projection operators occurring in their 
respective spectral decompositions commute [H. Unfortunately, counterexamples 



show that, in order to have the commutativity of A and B, it is not enough that 
[A, i?] = on a dense subspace of 7i on which this relation is well defined [Q! 
Admittedly, these examples ever scarcely appear in practice, but, in an approach 
which takes into account the domains of definition, one has at ones disposal all the 
tools that have to be called for, if a mathematical complication manifests itself. 



Concerning the raised mathematical points, we emphasize that in quantum mechanics a 
precise formulation is not only required for deciding about the existence or non-existence 
of physical effects (like the Aharonov-Bohm effect |l[]), but also for discussing the difficult 
interpretational problems (measurement theory, objectivity and reality,...) [Q. Besides, 
such a formulation directly applies to other fields of physics, one example being chaos in 
classical dynamical systems [|3 . 



4.1.2 "Solution" of problems 

Certain of the problems pointed out in the previous subsection are so involved that it seems 
more advisable to avoid them, rather than to look for a remedy. The complications mainly 
arise from the fact that - for conceptual reasons - one wishes to put forward the geometric 
Hilbert space structure which is underlying the theory. But this structure is also implicit 
in wave mechanics where the raised problems are absent or, at least, well posed from the 
beginning on. Thus, it is easy to avoid mathematical troubles or at least to render them 
more transparent by working with wave mechanics. 
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However, one cannot ignore that the abstract Hilbert space formahsm is so widely used 
in the physics hterature that all physicists need to be acquainted with it, though, at the 
same time, they should be able to resolve mathematical puzzles like those presented in 
section 2; thus, for the teaching of quantum mechanics, an obvious "solution" of problems 
is to start with an introduction to wave mechanics which emphasizes the underlying geo- 
metric structures and to indicate the arbitrariness of this formulation by passing over to 
other representations like matrix mechanics. (Such an approach to quantum theory and 
Schrodinger operators is supported by a vast and easily accessible mathematical htera- 
ture.) In modifying the explicit definition of the Hilbert space and its scalar product, the 
formalism of wave mechanics on R can then be generalized straightforwardly to several 
spatial dimensions, to the spin or to systems of particles. A subsequent discussion of the 
abstract Hilbert space Ti. and its relation to L^(R, dx) then paves the way to the literature 
and to formal calculations. 



4.2 Dirac's notation 

As we mentioned already, Dirac's bra and ket formalism consists, on one hand, of a certain 
writing of vectors, linear forms, ... and, on the other hand, of a particular interpretation 
of the mathematical operations which involve these entities. 



4.2.1 Inconveniences 

This writing, or rather its interpretation, presents a certain number of drawbacks which 
are more or less embarrassing. Among these, the most troublesome is the fact that it is 
impossible to give a precise meaning to the adjoint (of an unbounded operator A) if one 
strictly adheres to Dirac's interpretation (see ^ and also f^). Concerning this aspect, let 
us recall Dirac's fundamental definition (e.g. see equations (B.45) and (B.51) of chapter II 
of Cohen- Tannoudji et al. pTl): 

(($|A) 1^) = ($1 (A|^)) = with {^\A={A^^\ . (4.1) 

According to these relations, one cannot tell whether the expression ($|y4|\E') is to be 
interpreted as 

\E') (in which case 1$) G ©(^t) and \-^) en) 



or as 

{(^\A^) (in which case |^) G I'(A) and |<I>)g7^), 

unless one reintroduces the parentheses (which obviously takes away the simplicity and 
elegance of the calculus). Alas |T^, it is possible that T)(A^) = {0} for an operator A which 



is defined on a dense subspace of H. Even though this case scarcely ever appears in practice, 
the examples 3 and 7 of section 2 show that the ignorance of domains of definition can 
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readily lead to contradictions and incorrect results; accordingly, the correct treatment of a 
problem involving operators which cannot be defined everywhere (unbounded operators) 
is subtle. 

If one agrees upon the assumption that ($|y4|\E') is to be interpreted as ($| (y4|\E')), so 
that {^\A does not stand for but simply for a composition of operators (according 

to equation ( |3.12|) ), then the mathematical ambiguities concerning matrix elements are 
discarded. Yet, some inconveniences remain: we will discuss these in the familiar case 
where one strictly applies the bra and ket notation in an abstract, infinite dimensional 
Hilbert space H. 

• Rigid notation: Let us first recall the standard definition of the adjoint of a linear 
operatoifl A : H ^ H : 

(1$), v4|^))^ = (At|$), for all |$),|^) . (4.2) 

If one rigidly adheres to Dirac's notation, the expression on the right-hand side has to 
be rewritten using the skew-symmetry of the scalar product, ($|\1') = (\E'|$)*; thus, 
relation ( |4.2| ) which defines the adjoint of A becomes 

= (^|At|$)* for all 1$), 1^) G 7^ . (4.3) 

Consequently, the matrix element (v4|$), |^) can only be represented by (\Ef|A|$)* 
or by ($|At|\['). A frequently used example is given by 

\\A\m' = {A\^),A\^il))n = {\^),AU\^))n = {^\aU\^) , 
where the last expression is the only acceptable writing according to Dirac. 

• Lack of naturalness and simplicity: As indicated in section 3.1.1, one can do without 
the discussion of the dual space H* (the space of bras), since this one is isometric to 
Ti. Now Dirac's formalism makes a systematic use of Ti*. While we are accustomed 
to operators and matrices acting on "everything in front of them" , in this formalism 
one has to distinguish between the action of linear operators to the right and to the 



left [21 



yl(A|$) +/i|^)) = AA|$)+M|^) (A,^gC) 

(A($| +/i(^|)yl = A(<l>|A + /i(^|A , (4.4) 

which entails potential ambiguities concerning the domains of definition. Further- 
more, one has to change the natural order of vectors in certain expressions that are 



often used (e.g. compare equations ( |4.2| ) and (^73|)). 



^In order to avoid the discussion of domains of definition, we assume that A is a bounded operator: in 
this case, A and can be defined on the entire Hilbert space Ti.. 
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Changing computational rules: When passing from Ti. to L'^(R,dx) (which one is 
practically always obliged to do at a certain point, since physics is happening in 
configuration space), some of the calculational rules change: the operators of differ- 
entiation on L^(R, dx) only act to the right, their matrix elements can be written as 
{A{p,ip)x^2, ... and so on. 

Difficult mathematical interpretation in an abstract Hilhert space H: If one assumes 
(as we did) that the vectors belong to an abstract, infinite dimensional Hilbert space, 
then one recovers all of the problems mentioned in section 4.1. In this case, Dirac's 
bra and ket formalism represents a purely symbolic calculus and it is certainly 
not by chance that von Neumann did not look for an explanation or mathematical 
formulation of this approach when working out the mathematical foundations of 
quantum mechanics (see also |0). The modern introductions to this formalism 
try to render its mathematical content a bit more precise, but there exist only a few 
serious attempts which try to translate Dirac's approach into a rigorous mathematical 



theory, following an appropriate interpretation of it |2^, |^ . The resulting 

theories (most of which involve, from the beginning on, abstract Gelfand triplets and 
spectral families) are quite complicated and difficult to handle. 

Non pedagogical approach: The basic concepts of linear algebra (like linear maps, 
scalar products,...) are heavily used in all fields of physics (analytical mechanics, 
electrodynamics, relativity,...) with the standard mathematical notation and not 
with Dirac's formalism. Functional analysis based on the space L'^(R,dx) (or on 
I2) is a natural synthesis of linear algebra and real analysis and some notions of 
this theory are part of the standard mathematical luggage of any physicist (e.g. by 
means of Fourier analysis). On the other hand, Dirac's symbolic calculus sometimes 
conveys the impression of representing something qualitatively new and essentially 
unavoidable for the development of quantum mechanics^. 



For the finite dimensional Hilbert spaces involved in the description of the spin (or the 
angular momentum) of particles, one has Ti. ~ C" and Dirac's notation then represents 
a rewriting of vectors and a particular interpretation of the operations of standard linear 
algebra [|^. In this case, all expressions are mathematically well defined, but not all of 
the other inconveniences that we mentioned are discarded. 



4.2.2 Advantages 

The great power of Dirac's notation consists of the fact that it allows us to perform formal 
calculations which automatically lead to the correct form of the results. For instance, 

■^It may be worthwhile to recall that quantum theory has been developed without the use of this 
formalism |]37| and to note that its teaching can largely or completely do without it, as demonstrated by 
a fair number of excellent textbooks |4^, |49| . 
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insertion of the identity operator ( |3.8| ) between two operators A and B, 

/ oo \ oo 

{^\AB\<i/)^ = ' {^\AiY,\n){n\] B\^) = Y.{^\A\n){n\B\^) , (4.5) 



\n=0 / n=0 



immediately yields the right form of the final result without the need to contemplate 
the successive action of the maps \n) and {n\ described in equation ( p.9|) . Similarly, the 
projection P|<j,) on the state |$) G 7i simply reads 

and for its matrix elements, one readily gets 

(^|P|<j,)|^') = (^|<l>)(<l>|*') . (4.6) 

4.2.3 "Solution" of problems 

As we just emphasized, the notation |\1') for vectors and (\E'| for linear forms is quite 
useful for mnemonic and computational purposes. Thus, it would be out of place to avoid 
this notation and to do without its advantages. A good compromise which we will now 



summarize is the one adopted or mentioned in a certain number of texts |T^, ^ 

Since the concrete Hilbert space L'^(R,dx) is isomorphic to the abstract Hilbert space 
Ti, of Dirac's approach, we can "identify" them: by writing Ti = L'^(R,dx), we already 
avoid the mathematical complications of the invariant formalism (section 4.1). In any case 
- whether or not this identification is made - it is often convenient to use Dirac's bra and 
ket notation within wave mechanics. Henceforth, ones writes the wave functions as 
rather than -0 - or as ijj) as suggested by Dirac himself pf - in order to memorize the 
following relations which hold for any orthonormal basis {|v5n)}nGN of Iv^(R, dx): 

\Vn){fn\ = • 

riGN 

Here, the last relation means that 

= T.MM^) forall |^)eL2(R,cia;) 

neN 

or \ip{x)) = 'Y \(pn{x)){ipn\ip) for all xGR . 

nGN 

In the same vein, the projector on e i^^(R, dx) can be written as = 
For operators, it is convenient to use the notation ||21[ 

\Ai;)^A\^) , 
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while avoiding the interpretation of matrix elements which represents a source of 

ambiguities; a matrix element can then be written in any of the following forms: 



The insertion of the identity operator or of a projection operator are realized as in expres- 
sions (^]5|) and ( [4.6|) , respectively. 

Finally, the notation |n..m), instead of (fn..m, for the vectors of a Hilbert space basis 
indexed by n, ..,m is quite useful for writing matrix elements^, 



Hence, by allowing for some flexibility, one can benefit of the advantages of Dirac's notation 
while avoiding its inconveniences. 



■^However, beware of the fact that the resulting matrix represents a hnear operator on the infinite 
dimensional Hilbert space I2 so that one has to worry about its domain of definition: the existence of 
matrix realizations and their mathematical pitfalls are discussed in reference [B| . 




35 



Chapter 5 
Conclusion 



Let us try to draw some conclusions from the previous discussions, in particular for the 
teaching of quantum mechanics. 

Physics and mathematics are two different sciences and one can fully justify that a 
physicist's presentation does not take into account a perfect mathematical rigor even if 
the author completely masters this one. In physics, it probably is an art to use a mini- 
mum of mathematics while remaining precise enough in ones reasoning and presentation 
that a mathematical physicist can complete all technical details without ambiguities and 
thereby establish the results and their domain of validity in an irrefutable manner. For a 
quantum mechanics course, such an approach amounts to providing precise definitions in 
the beginning (for linear operators on L'^(R,dx)) while avoiding systematic discussions of 
mathematical details (domains of definition, distributions, ...) in the sequel. On the other 
hand, any approach which starts with a symbolic calculus that is quite difficult to render 
rigorous (and thus capable of precise conclusions) seems questionable. This is all the more 
true since the first approach is not more complicated and since it is based on a standard, 
well developed mathematical theory finding applications in many other fields of physics 
(dynamical systems, relativity, optics, ...). 

For the reader's convenience, we conclude with a short guide to the literature which only 
considers the presentation of the mathematical formalism and not the one of the physical 
applications. In view of the vast number of textbooks on the subject, such a list can only 
be incomplete and provide some hints for the interested reader. 

The physics books which do not follow the symbolic calculus approach (and which 
mention the domains of definition as well as the difference between Hermitian and self- 
adjoint operators) are not very numerous: let us cite the monographs [0 which are not 
based on the abstract Hilbert space formalism and which make a liberal use of Dirac's 
notation any time this appears to be beneficial (see also 0). Reference is a 
nice mathematically minded introduction to quantum mechanics which does not strive for 
an overly complete rigor. A presentation which is comparable, though more mathemat- 
ical and oriented towards the conceptual foundations, is given in while the treatises 



23, 24, 54, 47] can be qualified as belonging to the field of mathematical physics. Among 
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the textbooks |^ , those of Peebles, Schwabl and Messiah discuss in detail wave mechanics 
and its geometric structure before presenting the invariant formalism and Dirac's notation. 
Finally, we also mention some texts which avoid the invariant formalism and a rigid use 
of Dirac's notation, though they do not discuss the mathematical details concerning op- 
erators on L^(R, dx): apart from the 'classics' |Q, these are the elementary and modern 
introductions |^ which clearly present the principles of the theory while applying a strict 
minimum of useful mathematics. 



Acknowledgments 

I seize the opportunity to thank the superb teachers from whom I could learn quantum 
mechanics in Gottingen (H. Goenner, H. Roos, F. Hund (tl997), J. Yngvason, H. Reeh) 
and in Berkeley (G.W. Mackey). Thanks to A. Bilal for explaining a bit the invariant 
formalism to me upon my arrival in France! I express my gratitude to I. Laktineh, M. Ki- 
bler and R. Barbier for enduring my innumerable complaints, for all our discussions on 
quantum mechanics and for their comments on the text. Thanks to D. Maison, P. Breiten- 
lohner, S. Fleck, H. Grosse, H. Kiihn, R. McDermott, B.F. Samsonov and the very much 
regretted K. Baumann (flQQS) for their pertinent remarks on the examples! I would also 
like to express my gratitude to Benedicte Bruckert and Annabelle Pontvianne for numerous 
discussions on physics and mathematics. The revised version of the text owes at lot to the 
unknown referees and to their constructive remarks on the original presentation. 



37 



Appendix A 

There is no surprise 



The solution of the problems and apparent contradictions encountered in section 2.1 can be 
rephrased in the following way [|1^]: the theory of linear operators on infinite dimensional 



vector spaces is more complicated and interesting than the theory of finite order matrices. 
Let us now discuss the aforementioned problems while applying the mathematical results 
of section 2.2. 

(1) Suppose the commutation relation [P,Q] = — 1 is satisfied by operators P and Q 

acting on a Hilbert space 7i of finite dimension n (i.e. 7i ~ C^). In this case, P and Q 
can be realized by n x n matrices, the trace is a well defined operation and we obtain the 
result 

= Tr[P,g]©Tr(^l„) = yn . 

From this senseless result, one concludes that Heisenberg's relation cannot be realized on 
a finite dimensional Hilbert space. Thus, quantum mechanics has to be formulated on an 
infinite dimensional Hilbert space: on such a space, the trace is not anymore a well defined 
operation for all operators (in particular, the trace of the operator 1 does not exist) and 
therefore one can no more deduce a contradiction from Heisenberg's commutation relation 
in the indicated manner. 

An inconsistency can still be deduced in another way on an infinite dimensional Hilbert 



space by assuming that P and Q are both bounded operators |T^; accordingly, at least one 
of the operators P and Q satisfying Heisenberg's relation has to be unbounded and therefore 
this fundamental relation cannot be discussed without worrying about the domains of 
definition of operators. 

(2) As noted in section 2.2.2, the maximal domain of definition of the operator P = 
h d 

— —on the Hilbert space Iv^(R, dx) is 
1 dx 

I^max(P) = G L\K, dx) I ^' e L2(R, dx)} . 

The functions belonging to 2^max(-P) therefore enjoy certain regularity properties and their 
derivative is square integrable on R. In particular, these functions are continuous and their 
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limit for x ±00 is zero 0, this implies that the operator P, acting on T'max(-P), is 
Hermitian. The aforementioned function, which is unbounded at infinity, is differentiable, 
but its derivative is not square integrable and therefore it does not belong to I^max(-P)- 

Another choice of domain for P is given by the Schwartz space 5(R) C "Pmaxl-P)- In 
this case, the functions on which the operator P acts, do even have a rapid decrease at 
infinity. 

(3a) The Schwartz space iS(R) C i^^(R, dx) is an invariant domain of definition for the 
operators P and Q and thereby also for A = PQ^ + Q^P : 

A : S(R) — > 5(R) . 

Integration by parts shows that the so-defined operator A is Hermitian : 

{g, Af) = {Ag, f) for all /, g G V{A) = 5(R) . 



The function / given by ( |2.2| ) belongs to the Hilbert space L^(R, dx), but it does not 
belong to the domain of A, since it does not decrease more rapidly than the inverse of 
any polynomial at infinity: for instance, x^f{x) oc x'^''^ exp [— l/(4x^)] is not bounded for 
X +00. By way of consequence, h/i is not an eigenvalue of A. 

However h/i is an eigenvalue of the operator which has the same operating 
prescription as A. Before discussing this point, it is preferable to first consider the solution 
of the other problems. 

(4a) The astonishing results we referred to in this example, indicate that it is not enough 
to verify that an operator is Hermitian to identify it with an observable: we already dealt 
with this well-known fact in section 2.2.2 where we showed that the operator P and its 
adjoint act in the same way, but admit different domains of definition (given by (|2.18|) and 
(|2.21|), respectively). Thus, we established that P is Hermitian, but not self-adjoint. 



The results concerning the spectrum of P which are cited in this example, indicate that 
the spectrum of a Hermitian operator is not simply the set of its proper or generalized 
eigenvalues. Since this point is a more subtle one, we postpone its discussion. 

(5) The operator of multiplication by (p on the Hilbert space Ti = L^([0, 27r], c?(y9) is 
everywhere defined and self-adjoint: 

{9,^f) = {^9j) for all gJeH . 

h d 

The discussion in section 2.2.2 concerning the operator P = — —on L'^(\0, l],dx) applies 

i dx 

h d 

verbatim to the operator = — — on L'^{[0, 27r], d(p) : integration by parts yields 



f2TT 1 h da\ h r 1 

1^ d^ {gLJ - U M /)(^) = ^ [^7(27r)/(27r) - g{0)f{0)] for all / G V{L,) . (A.l) 
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Due to the periodic character of the polar angle, the functions belonging to the domain of 
Lz are periodic^ 

L. = -i- , V{Lz) = {fen\fen and /(O) = f{2n)} . (A.2) 
1 dip 

Accordingly, the surface term in (|A.1|) vanishes if and only if g{0) = g{2TT) : this implies 

that lI operates in the same way as Lz and that it admits the same domain, hence the 
operator ( |A.2|) is self-adjoint. 

In order to determine the domain of definition of the commutator [Lz, (f], we note that 
for any two operators A and B, we have 

V{A + B) = V{A)r]V{B) (A.3) 
V{AB) = {feViB)\BfeV{A)} . 

Thus, V{[Lz,(p]) = V{Lz(p)nV{ipLz) with 

vi^Lz) = {f eV{Lz)\Lzf ev{^)=n} = V{Lz) 
v{Lz^) = {f ev{^) = n\vf eV{Lz)} . 

But the function / = iff, which appears in the last expression, takes the values 

/(O) = (^/)(0) = 
f{2n) = ((p/)(27r) = 27r/(27r) 

and / G V{Lz) implies /(O) = /(27r), i.e. /(27r) = 0. 
In summary, 

V{^Lz) = V{Lz) 

ViLzif) = {f en\f en and /(27r) = 0} (A.4) 
Vi[Lz,ip]) = {f en\r en and /(O) = = /(27r)} . 

The eigenfunctions ipm{'^) = exp (irrup) of Lz do not belong to the domain of definition 

V 27r 



of since they do not vanish at the points and 27t : therefore the derivation ( |2.6| ) 

does not make any sense. 

(6) Let us consider two observables A and B (i.e. self-adjoint operators on the Hilbert 
space n) and a state ip (i-e- a unit vector belonging to n). The uncertainty relation for 



A, B is usually written in the form [|12 



A^A ■ A^B > ^ I B]ij) I , (A.5) 



^Concerning this point, we note that the use of polar coordinates assigns a distinguished role to the 
polar semi-axis ip — 0, while this axis is not privileged if one chooses other coordinate systems like Cartesian 
coordinates: therefore a discontinuity of wave functions on this axis (/(27r) — e'"/(0) with a ^ 0) has no 
raison d'etre. 
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where {A^Af = \\ {A - {A)^l) tpW^ with {A)^ = {tp^Aifj) and hkewise for B. Thus, the 
left-hand side of relation ( |A.5| ) is defined for ip G 'DlA) fl "D^B) (which is precisely the 
subspace of 7i containing all states ip for which the uncertainties A^A and A^i? both 
have a physical meaning). On the other hand, the right-hand side is only defined on the 
subspace B]) = T>{AB) fl T>{BA) which is much smaller in general. 

However, A, B being self-adjoint, relation ( |A.5| ) can be rewritten in the form 



A^A ■ A^B > ^ I i{Aij, Bij) - i{Bij, A^j) \ , (A.6) 

where the domain of the right-hand side now coincides with the one of the left-hand side, 
i.e. 'D{A) n T>{B). Thus, the product of uncertainties for two observables A and B is not 
determined by their commutator, but by the Hermitian sesquilinear formj^ 

$a,b(/, g) = i{Af, Bg) - i{Bf, Ag) for all f,ge V{A) n V{B) . 

The derivation of inequality ( |A.6[ ) is the same as the derivation of ( [A.5| ) (see for instance 



18| for the latter). It can be done in a few lines: let ^p G 'D(A) fl 'D(B) and let 

A = A-{A)^1 , B = B-{B)^1 ; 

by using the fact that A and B are self-adjoint and by applying the triangle as well the 
Cauchy-Schwarz inequalities, we obtain the inequality (|A.6|) : 

\i{Aip,Bip) -i{B^p,Aij) \ = \i{A'4j,B'4j) -i{Bip,Aij) \ 

< I {Aip.Bilj) I + I {Bip,A%lj) I = 2 I {Aij,l3tp) \ 

< 2\\A^\\ ■ \\B^|J\\ = 2A^A-A^B . 

Let us now show that this modification is not simply a cosmetic one. For A = P = 
fl d 

and B = Q = x on T-[ = L'^(R,dx), the right-hand side of inequality ( [A.6| ) is 



i dx 

easily evaluated using integration by parts; it implies the well-known uncertainty relation 

A^P ■ A^Q > ^ for ip e V{P) n V{Q). On the other hand, ioi A = = ^ and 
2 1 dip 

B = ip on Ti. = L^([0, 27r], (iyj), the surface term occurring upon integration by parts does 
not vanish and it leads to the uncertainty relation 

A^L,-A^<^>^ I l-27r|^(27r)n for all ^ G I)(L,) n I)((^) = P(L,) . (A.7) 

Thus, the product of the uncertainties A^Lz and A^yj may become smaller than h/2 - 
see Galindo and Pascual for an example. For states which belong to T>{[Lz,(p]), i.e. 
satisfy ip{2'7i) = 0, the inequality ( |A.5|) can also be applied and it yields the same result as 



^i.e. ^A,B{f,g) is linear with respect to g, antilinear with respect to / and <^A,B{g,f) = ^A,B{f,g)- 
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While the inequahty ( |A.7[ ) is mathematically correct, it is not acceptable in the present 
form from the point of view of physics: if one defines the average value and the uncertainty 
of the observable (y? by the usual formulae, then these expressions do not have the adequate 
transformation properties with respect to rotations, for which 



We refer to the literature [|T^, ^] for a slight modification of (|A.7|) which takes this problem 



into account, as well for estimates of the product A^L^ ■ A^Lp which do not explicitly 
depend on the particular state that one considers. Similar issues concerning the phase 
and number operators in quantum optics are discussed in reference 

(7) A purely formal solution of the problem can be obtained, if one considers the wave 
function to be defined on the entire real axis, rather than limiting oneself to the interval 
[—a, +a]. In fact, for the function ip defined by (|2.8| ), the discontinuity of ip" at the points 
X = ±a implies that ip"" is given by derivatives of Dirac's generalized function: 



= [^'(^ + ^) - '^'(^ - ^)] for X G R . 

Substitution of this expression in {ip, H'^ip) then leads to the same non- vanishing result as 
the evaluation of Y^^=i E^Pn- The mentioned inconsistency therefore originates from the 
fact that one has not properly taken into account the boundary conditions in the calculation 



In the sequel, we will show how a rigorous reasoning limited to the interval [—a, +a] 
allows to incorporate the boundary conditions and to confirm the non-vanishing result for 
{H"^)^. To start with, we define H and as self-adjoint operators on the Hilbert space 
H = L^{[-a,+a],dx). 

The infinite potential well is a mathematical idealization which is to be interpreted as 
the limit Vq — » cxo of a potential well of finite height Vq. For the latter, one finds that, 
outside the well, the wave functions of stationary states tend to zero if Vq ^ oo; by way of 
consequence, ip{±a) = is the appropriate boundary condition for the particle confined to 
the inside of the infinite well. Let us now study whether H is self-adjoint, when acting on 
sufficiently smooth functions satisfying -^(ia) = : by virtue of two integration by parts, 
we find 

2m J-a 

dx (p"{x)ip{x) + - — {ip'ip — (pil)'){x) 



2m J-a 2m 



= (i7ty,,^)_|l[^(a)V;'(a)-^(-a)^'(-a)] . 
2m 

Since we do not have any constraints on ilj'{±a), the surface term vanishes if and only if 
(f{±a) = 0. In summary, operates in the same way as H and the functions ip belonging 



2 
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to its domain of definition satisfy the same conditions as those belonging to the domain of 

H. Hence, the operator H = — acting on 7i, with the domain of definition 

2m dx^ 

V{H) = {^en I ip" en and ilj{±a) = 0} (A.8) 

represents a self-adjoint operator (i.e. an observable). Its spectrum, which has been made 
explicit in equation (p.lO|) , is discrete and non-degenerate and the associated eigenf unctions 
are given by 

— X for n = 2, 4, 6, ... 




for n = 1,3, 5, 



Accordingly, the spectral decomposition of H reads H = EnVn where Vn denotes the 
projector on the normed state ipn '■ Vn^J = {fn,i') ^n- 

By virtue of the spectral theorem , the operator is defined in terms of the spectral 
decomposition of if, 

oo 

H' = T.K'Pn , (A.9) 

n=l 

which implies that H'^^Pn = E'^Vn- In order to determine explicitly the domain of definition 
on which this operator is self-adjoint, one successively performs four integrations by parts: 

dx ^""{x)^{x) + -— {-ipip'" - if'lp" + ip"ip' - (x) 



4m2 J-a Am? 

The boundary conditions ip{±a) = = <f{^a) of the infinite well eliminate the first and the 
last contribution of the surface term. In order to annihilate the others, there are different 
possibilities, e.g. '?/''(±a) = = ip'{±a) or ip"{±a) = = ip"{±a). But, according to the 
definition ( |A.9| ) of H^, the eigenfunctions <^n of H must belong to the domain of H^: since 
these functions satisfy ip'^{±a) = 0, the domain of definition of the observable is 

ViH"^) = {iljen I ijj"" e n and ^(±a) = = <(±a)} . (A.IO) 

d^ 

We note that this represents but one way to render the operator — — on 

4m^ dx^ 

L^([— a, +a]) self-adjoint among many other possibilities (determined by other boundary 
conditions, e.g. ip{^a) = = ilj'{±a)); but it is the one corresponding to the physical 
system that we consider here. 

Let us now come to the paradox pointed out in our example. For i/j e V{H^) C V{H), 
the decomposition (|A.9| ) yields 

/ oo \ oo 

\ n=l I n=l 

oo 

= E^nPn (A.ll) 



n=l 
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with pn = I (v^n, '?/') If ^ € V{H), we can reach the same result in another way by using 
the fact that the projectors Vn are self-adjoint and orthogonal (i.e. VnVm = ^nmPn) '■ 

(oo oo \ oo 

n=l m=l I n,m=l 

oo 

n,m=l 
oo 

= E^nPn . (A.12) 



n=l 



The function ip(x) = vT5/(4a^/^) (a^ — x^) of our example does not satisfy 'ip"{±a) = 0, 
hence it does not belong to the domain of definition of H^: thus the expression (ip, H^ip) 
is not defined, because the variable that it involves is not simply characterized by its 
operating prescription, but also by its domain of definition. (In other words: although the 
integral in equation ( ^.91 ) is properly evaluated, it cannot be identified with {ip, H^ip) = 
{H'^)ij, for the function ijj considered here.) On the other hand, we have ip G 'D{H) and the 
average value {H'^)^ can be evaluated according to SJ^i-^'nPn equivalently, according 
to 

imr^ dx\m){x)\' = -— dx\r{x)\'- 



Am? J -a ^m?a'^ 

(4b) Let us now consider the second point mentioned in example 4, i.e. the spectrum of 
P. As indicated in section 2.2.2, the spectrum of an operator P, which is not self-adjoint, 
contains in general a part called residual spectrum: by definition, these are all numbers 
2; G C which are not eigenvalues of P, but for which z is an eigenvalue of pt. In the present 
example, the domains of P and pt are given by ( |2.18| ) and ( p.21| ), respectively; thus the 



functions belonging to ©(pt) do not satisfy any boundary conditions, while those belonging 
to T>[P) vanish at the boundaries x = and x = 1. Since the functions v'p(a;) = exp {—px) 
with p G C are solutions of the eigenvalue equation for pt, 

(Pt<^p)(x) =pMx) G P(pt) , fp^O) , (A.13) 

all complex numbers are eigenvalues of pt. But none of these is an eigenvalue of P, 
because ipp does not vanish on the boundaries and therefore does not belong to V^P). By 
way of consequence, the residual spectrum of P is C. In fact, this represents the complete 
spectrum of P since its discrete and continuous spectra are empty. 

This conclusion resolves the puzzle of example 4, but it leaves the physicist perplexed: 
since P is not self-adjoint, its spectrum does not admit a direct physical interpretation. 
However, we will show right away that it contains information which is important for 
physics. 

In order to study whether the domain of the Hermitian operator P can be enlarged in 
such a way that P becomes self-adjoint, it is suitable to apply von Neumann's theory of 



44 



defect indices |[: according to this theory, one has to study the complex eigenvalues 
of pt 

(and this is precisely what we just did). As a special case of (|A.13| ), we haveQ 

ptypi = ±iLp± with ifi±{x) = e^"^/^ , 

or (pt :p il)v5± = 0. Thus, the kernel of the operator pt ± il is a one-dimensional vector 
space: 
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_(P) = dim Ker (pt + il) = 1 

_(P) = dim Ker (pt - il) = 1 . (A. 14) 



The natural numbers n+(P) and n_{P) are called the deficiency or defect indices of P. 
Their usefulness is exhibited by the following result: 

Theorem 3 (Criterion for self-adjointness) Let A he a Hermitian operator with defi- 
ciency indices n+ and 

(i) A is self-adjoint if and only if = = n^. In this case (and only in this one), the 
spectrum of A is a subset of the real axis. 

(ii) A admits self-adjoint extensions (i.e. it is possible to render A self-adjoint by enlarging 
its domain of definition) if and only if = n_. If > and n_ > 0, the spectrum of A 
is the entire complex plane. 

(iii) // either n+ = 7^ n_ or n_ = ^ n^, the operator A has no nontrivial self-adjoint 
extension. Then, the spectrum of A is, respectively, the closed upper, or the closed lower, 
complex half-plane. 

In the case (ii), there exist explicit expressions for the possible self-adjoint extensions of 

A \m. 



In our example, we have n+ = n_ > ; therefore the operator P is not self-adjoint 
and its spectrum is the entire complex plane (what we are already aware of). The explicit 
expressions for the possible self-adjoint extensions of P that we just alluded to, are as 
follows |n]: for every real number a, the operator 



P^ = -4- , V{Pa) = {^pen\^' en and ^lj{0)=e'''^lj{l)} (A.15) 
1 dx 

is self-adjoint (as may easily be verified) and one has Sp Pa = R. 

From the point of view of physics, the boundary condition ip{0) = means that 

everything that leaves the interval [0, 1] on the right-hand side, again enters the interval 
on the left-hand side with a certain phase-shift (determined by a G R): this allows for the 
existence of states with a well-defined value of momentum, whereas the boundary condition 
^^(O) = = '?/'(l) excludes such states. For a = 0, one has periodic wave functions and one 
recovers the self-adjoint operator ( |A.2| ). 

■^For the present mathematical considerations, one ignores the dimensions of the eigenvalues which 
should coincide with the dimensions of the operators. 



45 



(3b) We now come back to the statement that h/i is an eigenvalue of At. For / 



©(A) = iS(R), integration by parts yields 



(^[(xV + x'g]j) + 2-^[x\gf){x) 



-oo 



(A.16) 



Since / decreases rapidly at infinity, the surface term on the right-hand side vanishes if 
the function g does not grow faster than a polynomial at infinity. Assuming g has this 
property, the previous equation implies that the operator At acts in the same way as A, 



At 



h 

9 = - 
1 



x-'g)' + x^g 



n r 

i 



3x^g + 2x^g' 



(A.17) 



though its domain is larger than 5(R): this domain contains all functions g which are 
such that expression ( |A.17| ) exists and is square integrable. (For all of these functions, the 
surface term in equation ([A.16|) vanishes automatically.) 

In summary, the domain of definition of At is larger than the one of A, thus the operator 
A is not self-adjoint. Moreover, the function (|2.2|) does not belong to T>{A), but it belongs 
to ©(At), so h/i is an eigenvalue of At. 

To conclude, we briefly investigate whether the domain of A can be enlarged so as to 
render A self-adjoint. For this problem, we again resort to von Neumann's theory. One 
easily checks that 



At 



9± = ±i 9± 



with 



9±{x) = \x\~ 
g40) = . 



-3/2 



exp 



'Ahx^ 



^ for X 7^ 



We have g^ G X'(At), but g^ ^ X'(At) (due to the exponential growth of g+ at the origin), 
hence 

n_(A) = dim Ker (At + il) = 1 
n+{A) = dimKer (At - il) = . 

From point (iii) of the last theorem, it now follows that there is no way to render the 
Hermitian operator A self-adjoint. 

Although the introduction of the residual spectrum looks, at first sight, like a gratuitous 
and unphysical complication, the last two examples show that it is quite interesting from 
the point of view of physics. In fact, the deficiency indices of A (which are closely related 
to the residual spectrum of A) allow us to tell whether a given Hermitian operator can be 
rendered self-adjoint; and for those A which can, von Neumann's theory also provides a 
simple and constructive method for determining all possible self-adjoint extensions, i.e. it 
explicitly describes all possible ways of turning a Hermitian operator into an observable. 

A more intuitive understanding of the last two examples can be obtained by looking 
at the potential eigenfunctions of the involved operators and at their admissibility for the 
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physical problem under consideration. For the momentum operator P on the interval [0, 1], 

the plane wave exp (ipx) (with p e R) formally solves the eigenvalue equation for P, but 

it is not compatible with the boundary conditions ip{0) = = of the infinite potential 

well; on the other hand, the function fx{x) oc exp ( ^^ ^ ) can formally be associated 

to the real eigenvalue \of A = PQ^ + Q^P, but it is not square integrable due to its singular 
behavior at the origin. Thus, the crucial constraints for turning Hermitian operators into 
observables come, respectively, from the boundary conditions for a problem on a compact 
(or semi-infinite) interval and from the condition of square integrability for a problem on 
the whole spaceQ. (As a matter of fact, these are the very same conditions which lead to 
the quantization of energy levels, respectively, on a finite interval and on the whole space.) 



^Actually, the lack of square integrability only points towards potential problems for self-adjointness 
and, to be more precise, we should say the following. The eigenfunctions associated with the continuous 
spectrum do not belong to the Hilbert space and thus are not square integrable (section 2.2.3). However, 
in the case of differential operators, these eigenfunctions enjoy certain smoothness properties (which imply 
some local square integrability) and what matters for the continuous spectrum, is that the singularities 

(n) 

of the functions are "mild enough". For instance, for n — 1,2,.., the potential eigenfunctions of 

An — PQ" + Q^P satisfy |/^"''(a;)| cx |a;|^"/^ and, for n > 3, they are too singular at a; = to allow for 
the self-adjointness of An', however, Ai and A2 are self-adjoint, when defined on appropriate domains. 
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